Towards Robust Deep Neural Networks for Affect and Depression Recognition from Speech
نویسندگان
چکیده
Intelligent monitoring systems and affective computing applications have emerged in recent years to enhance healthcare. Examples of these include assessment states such as Major Depressive Disorder (MDD). MDD describes the constant expression certain emotions: negative emotions (low Valence) lack interest Arousal). High-performing intelligent would diagnosis its early stages. In this paper, we present a new deep neural network architecture, called EmoAudioNet, for emotion depression recognition from speech. Deep EmoAudioNet learns time-frequency representation audio signal visual spectrum frequencies. Our model shows very promising results predicting affect depression. It works similarly or outperforms state-of-the-art methods according several evaluation metrics on RECOLA DAIC-WOZ datasets arousal, valence, Code is publicly available GitHub: https://github.com/AliceOTHMANI/EmoAudioNet.
منابع مشابه
Robust speech recognition with speech enhanced deep neural networks
We propose a signal pre-processing front-end to enhance speech based on deep neural networks (DNNs) and use the enhanced speech features directly to train hidden Markov models (HMMs) for robust speech recognition. As a comprehensive study, we examine its effectiveness for different acoustic features, acoustic models, and training-testing combinations. Tested on the Aurora4 task the experimental...
متن کاملFactored Deep Convolutional Neural Networks for Noise Robust Speech Recognition
In this paper, we present a framework of a factored deep convolutional neural network (CNN) learning for noise robust automatic speech recognition (ASR). Deep CNN architecture, which has attracted great attention in various research areas, has also been successfully applied to ASR. However, to ensure noise robustness, since merely introducing deep CNN architecture into the acoustic modeling of ...
متن کاملBinary Deep Neural Networks for Speech Recognition
Deep neural networks (DNNs) are widely used in most current automatic speech recognition (ASR) systems. To guarantee good recognition performance, DNNs usually require significant computational resources, which limits their application to low-power devices. Thus, it is appealing to reduce the computational cost while keeping the accuracy. In this work, in light of the success in image recogniti...
متن کاملDeep segmental neural networks for speech recognition
Hybrid systems which integrate the deep neural network (DNN) and hidden Markov model (HMM) have recently achieved remarkable performance in many large vocabulary speech recognition tasks. These systems, however, remain to rely on the HMM and assume the acoustic scores for the (windowed) frames are independent given the state, suffering from the same difficulty as in the previous GMM-HMM systems...
متن کاملTowards Robust Deep Neural Networks with BANG
Machine learning models, including state-of-the-art deep neural networks, are vulnerable to small perturbations that cause unexpected classification errors. This unexpected lack of robustness raises fundamental questions about their generalization properties and poses a serious concern for practical deployments. As such perturbations can remain imperceptible – commonly called adversarial exampl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-68790-8_1